StringDecoder : Buffer to string decoding

更新时间:
2024-03-13

StringDecoder : Buffer to string decoding

The string_decoder module provides an API for decoding Buffer objects into strings in a manner that preserves encoded multi-byte UTF-8 and UTF-16(for future) characters. It can be accessed using:

const { StringDecoder } = require('string_decoder');

The following example shows the basic use of the StringDecoder class.

Example

const { StringDecoder } = require('string_decoder');
const decoder = new StringDecoder('utf8');

const cent = Buffer.from([0xC2, 0xA2]);
console.log(decoder.write(cent));

const euro = Buffer.from([0xE2, 0x82, 0xAC]);
console.log(decoder.write(euro));

When a Buffer instance is written to the StringDecoder instance, an internal buffer is used to ensure that the decoded string does not contain any incomplete multibyte characters. These are held in the buffer until the next call to decoder.write() or until decoder.end() is called.

In the following example, the three UTF-8 encoded bytes of the European Euro symbol () are written over three separate operations:

Example

const { StringDecoder } = require('string_decoder');
const decoder = new StringDecoder('utf8');

decoder.write(Buffer.from([0xE2]));
decoder.write(Buffer.from([0x82]));
console.log(decoder.end(Buffer.from([0xAC])));

Support

The following shows StringDecoder module APIs available for each permissions.

 User ModePrivilege Mode
StringDecoder
decoder.write
decoder.end

StringDecoder Class

new StringDecoder([encoding])

  • encoding {String} The character encoding the StringDecoder will use. default: 'utf8'.
  • Returns: {Object} StringDecoder object.

Creates a new StringDecoder instance.

Example

const decoder = new StringDecoder('utf8');

StringDecoder Object

stringDecoder.write(buffer)

  • buffer {Buffer} | {TypedArray} | {DataView} A Buffer, or TypedArray, or DataView containing the bytes to decode.
  • Returns: {String} decode result.

Returns a decoded string, ensuring that any incomplete multibyte characters at the end of the Buffer, or TypedArray, or DataView are omitted from the returned string and stored in an internal buffer for the next call to decoder.write() or decoder.end().

stringDecoder.end([buffer])

  • buffer {Buffer} | {TypedArray} | {DataView} A Buffer, or TypedArray, or DataView containing the bytes to decode, default: undefined.
  • Returns: {String} decode result.

Returns any remaining input stored in the internal buffer as a string. Bytes representing incomplete UTF-8 and UTF-16 characters will be replaced with substitution characters appropriate for the character encoding.

If the buffer argument is provided, one final call to decoder.write() is performed before returning the remaining input.

文档内容是否对您有所帮助?
有帮助
没帮助